On the reliability of polarization estimation using Rotation Measure Synthesis 

J.-P. Macquart\ R.D. Ekers^'^, I. Feain^ and M. Johnston-Hollitt^ 

"^" ; ABSTRACT 

CN I We benchmark the rehabihty of the Rotation Measure (RM) synthesis algorithm 

using the 1005 Centaurus A field sources of Feain et al. (2009). The RM synthesis 
solutions are compared with estimates of the polarization parameters using traditional 
methods. This analysis provides verification of the reliability of RM synthesis estimates. 
We show that estimates of the polarization parameters can be made at lower S/N if 
the range of RMs is bounded, but reliable estimates of individual sources with unusual 
RMs require unconstrainted solutions and higher S/N. 

We derive from first principles the statistical properties of the polarization ampli- 

r^ [ tude associated with RM synthesis in the presence of noise. The amplitude distribution 

depends explicitly on the amplitude of the underlying (intrinsic) polarization signal. 
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Hence it is necessary to model the underlying polarization signal distribution in order 



c/5 ■ to estimate the reliability and errors in polarization parameter estimates. We intro- 

duce a Bayesian method to derive the distribution of intrinsic amplitudes based on the 
distribution of measured amplitudes. 
^ i The theoretically-derived distribution is compared with the empirical data to provide 

^^ I quantitative estimates of the probability that an RM synthesis solution is correct as a 

t^^ I function of S/N. We provide quantitative estimates of the probability that any given 

^ ' RM synthesis solution is correct as a function of measured polarized amplitude and the 

^L) • intrinsic polarization amplitude compared to the noise. 

(N 

Subject headings: techniques: polarimetric - galaxies: individual (Centaurus A, NGC 

5128) 
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1. Introduction 

The birefringence of the magnetized plasma that pervades interstellar and intergalactic space 
causes the observed linear polarization properties of most astrophysical radio sources to be strongly 
frequency dependent. In cold plasmas the effect of Faraday rotation causes the position angle of the 
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linear polarization vector to increase by an amount A^RM. The magnitude of the rotation measure 
(RM) ranges between ^^ lOradm"^ for typical sources observed through the interstellar medium 
of our Galaxy (Johnston-Hollitt, Hollitt & Ekers 2004; Schnitzeler 2010) to ~ lO^radm'^ for the 
cores of some radio sources to as high as ~ 5 x 10^ rad m~^ for the compact radio source at the 
centre of the Milky Way, Sgr A* (Macquart et al. 2006, Maronne et al. 2007). 

It is often desirable to derive the rotation measure along a particular line of sight and recover 
the linear polarization vector intrinsic to the radio source at the point of emission. This requires ob- 
servations over a sufficiently large frequency range that the effect of Faraday rotation is measurable 
but with fine enough spectral resolution that Faraday rotation does not cause appreciable rotation 
of the polarization vector across individual spectral channels (bandwidth smearing). However, in 
many situations of astrophysical interest this trade-off results in a situation where the signal-to- 
noise ratio of the polarization measurement in each spectral channel is too low to compute either 
the rotation measure or intrinsic polarization vector without employing a method that simultane- 
ously utilizes the measurements across the entire spectral band. This is particularly pertinent to 
observations made with next-generation backends, such as that of the EVLA (Perley et al. 2011) 
and the Compact Array Broadband Backend (Wilson et al. 2011). 

The technique of rotation measure synthesis was developed by (Killeen et al. 1999) to search 
for polarization from Sgr A* in situations where the large expected RM required such high frequency 
resolution that the signal-to-noise ratio (S/N) per channel was too low. This technique, which was 
already implicit in the treatment of Burn (1966), was independently discovered by Brentjens & de 
Bruyn (2005) to handle the large Galactic rotations seen at low radio frequency. Over the past 
few years RM synthesis has emerged as the method of choice for the derivation of Faraday rotation 
measures from radio polarimetric data. RM synthesis utilizes both the amplitude and position angle 
of the polarization vector at all observed frequencies, and is capable of disentangling emission at 
multiple rotation measures (or Faraday depths) combined within the telescope beam. Because all 
the observed frequency dependent information is used without bandwidth smearing and without the 
nonlinear operations needed to calculate the variation in polarization position angle with frequency, 
it should also be the optimum method to extract other polarization information from noisy data. 
This technique is fundamentally different from previous methods which derived RMs by fitting 
position angle estimates at a usually small number of often widely spaced frequencies. 

Despite its extensive current use, a number of questions remain concerning the practical ap- 
plication of RM synthesis to real datasets. This includes debate on the confidence of the results of 
the technique as a function of S/N (e.g. Law et al. 2011), and on the reliability of the technique 
when there are two emission regions with closely-spaced RM and polarization vectors within the 
same telescope beam (Farnsworth, Rudnick &: Brown 2011). There is also confusion regarding the 
appropriate means of treating the effects of polarization bias, associated with the fact that the mean 
magnitude of the polarization vector is nonzero even if no polarized signal is present (Simmons &: 
Stewart 1985). 



The objective of this paper is to resolve these issues. The technique and nomenclature of 
RM synthesis is summarized in ^ In in ^ we proceed to examine several specific facets of the 
performance of rotation measure synthesis in detail. We utilize the 1005 source dataset of a field 
surrounding Centaurus A (Feain et al. 2009) to examine its performance in estimating polarization 
parameters. 

In ^ we derive the distribution of polarization amplitudes expected from RM synthesis data 
as a function of the amplitude of the input polarization signal. We use this to demonstrate the 
derivation of the intrinsic distribution of polarization amplitudes within a sample. This is applied 
to the Cen A dataset to examine the polarization flux density distribution down to the sub-mJy 
level; this presents a means of measuring the polarization distribution to well below the detection 
threshold of any given source, in a manner somwhat akin to that of "stacking" used in other surveys 
to achieve statistical detections of populations at detection thresholds below that of an individual 
source. 

In ^we examine the Cen A dataset for sources with unusual polarization behaviour, including 
sources with extreme or multiple RMs. Analysis of the changes in the goodness of the fit (as 
quantified by the reduced-x^ statistic) is used to identify the small number of sources with multiple 
RMs. The conclusions are presented in ^ 



2. RM synthesis: terminology and summary 

For the purposes of the following sections it is useful to briefiy recapitulate how RM synthesis 
determines the rotation measure and polarization of a source subject to Faraday rotation. The input 
data for RM synthesis is a stream of Q and V Stokes parameters, Qi,Q2, ■ ■ ■ , Qn and Ui,U2, ■ ■ ■ , Un 
where there are A^ spectral channels corresponding to wavelengths Ai, . . . Ajv- The polarization data 
can be equivalently represented as a series of complex numbers where Vj = Qj + iUj = Pje^^K 
(Throughout the text we use V to denote the complex polarization and P = \'P\ to denote its 
amplitude.) RM synthesis works by winding up the measurements of Qj and Uj for a range of 
different trial rotation measures, (j). If the weights corresponding to the spectral channels are 
denoted Wj, then the approximation to the Faraday dispersion function is 

N 

F{(t>) = irJ^P,e-2*'^(^'-^o), (1) 

(2) 

where Pk = WkVk and it is convenient (but not necessary) to choose Aq as the weighted average of the 
channel values of A^ over the observing band, i.e. Ag = J2k=i ^fc-^l/ Ylk=i ^fc- "^^^ approximation 
to the Faraday dispersion function is the convolution of the actual Faraday dispersion function. 
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R{<P) = lj;e-2^*(^-^o). (6) 



F{(j)), with a function, R{(j)), which describes the sensitivity of an observation to emission at each 
Faraday depth, (p: 

F{<P) = F{(P)^R{^). (3) 

The function R{(j)) is known as the rotation measure transfer function (RMTF) and is computed 
as fohows: 

AT 
i?(0) = i^^u;^.e-2^^(^'-^o). (4) 

In the simple case in which ah channel weights are equal, the above relations simplify to 

N 
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Now suppose a source contains polarization emission at one or more Faraday depths, (pk, and 
associated with each depth is a polarization vector "Psrc.k- It is clear from eq. ([5]) that for a dataset 
of polarization measurements for a source with Faraday depths (/>o , </>i , • • • the sum over channels 
results in a coherent summing of the polarization vectors, and the amplitude of the function F((f>) 
will be large. The tendency for this sum to add coherently for values of (pk that match (i.e. correctly 
de-rotate) the polarization data means that |-F((/>)| peaks at the locations of the rotation measure(s) 
in the source, with the quantity |-F((/)fc)| = jPsrcjijI giving the amplitude of the polarization vector 
that is rotating at each Faraday depth, 0^. Thus the problem of measuring the Faraday depths and 
rotation polarization vectors that are significant in a given dataset is one of locating those peaks 
in the function |-F((/))| that exceed some given threshold. 



2.1. RM synthesis deconvolution implementation (CLEAN) 

It is often necessary to recover the Faraday dispersion function, F, by deconvolving F from 
the RMTF, particularly if the source contains emission at multiple Faraday depths and if the S/N 
of the observations is higlo. To this end, we implemented a variation of the CLEAN algorithm 
(Hogbom 1974; Heald, Braun & Edmonds 2009) to handle the fact that both F and the RMTF 
are complex- valued. It searches for the highest peak in \F\ over a specified range of (j) and removes 
a suitably scaled copy of the RMTF centred on its corresponding Faraday depth. The algorithm 
iterated over successive peaks until either a maximum number of allowed iterations was reached 



^When the S/N is sufficient, this enables us to "super-resolve" F and search for multiple peaks in cj) a, scale smaller 
than the width of R. 



or the rms of the fluctuations in F in the search region matched the rms of F exterior to the 
search region. We used a loop gain of 7 = 1.0 in the CLEAN algorithm. Lower loop gain values, 
as normally used in 2D image deconvolution, were trialled but it was found that since these stop 
recovering the polarization signal once it reaches the noise floor; solutions in which 7 < 1 are used 
systematically underestimate the polarization amplitude by at least la. 

The RM synthesis code was supplemented by several additional features. Because the functions 
F and R were computed on a grid of Faraday depths much finer than the width of R, it is sometimes 
desirable to group together CLEAN components that are very tightly clustered together and identify 
them as members of a single component. By grouping components in this way we could obtain the 
X^ estimate for each independent component. We introduced a Faraday depth bunching tolerance, 
A(/), such that any two components with depths (jjj and (j)k were merged if |(/>j — </>fc| < A.(f). A 
tolerance ^(j) = 5 rad m~^ was found to be suitable for the dataset examined in the present paper. 

The code output a list of clean components ranked in order of amplitude, together with their 
corresponding Faraday depth and polarization vector Vi evaluated at the mean square wavelength 
Aq. The code also computed the reduced x^ associated with the polarization solution as a function 
of the number of clean components used. To be specific, for measurements with standard deviations 
ag and au in Q and U respectively, and clean components with Faraday depths cpd, (/'c2) • • • s-i^d 
associated polarization vectors Vci,'Pc2: ■ ■ ■: the reduced x^ associated with a fit to the data by 
including the first n clean components is. 



2 _ 1 V^ [Qi - Q{<Pcl, ■■■, (pcn'^Vcl, ■ ■ ■,'Pcn; -^i)] [Uj - Ujcpcl, ■ ■ ■ , 4'cni'Pcl, ■ ■ ■ ,^0^, >^i)] /^n 



where 

n 

g(0cl, . . ■Acn,'Pcl, ■ ■ ■,Vcn; Afc) + iU i^cl, ■ ■ ■ Acn,'Pcl, ■ ■ ■ .Vcn. Afc) = J]] P.j-e^^^''^^^ . (8) 

i=i 

This x^ statistic was found to be an excellent estimator of the number of significant clean compo- 
nents required to model the polarization behaviour of each source. It was also useful to identify 
cases in which RM synthesis solution differed significantly from the polarization behaviour of the 
source; this highlighted cases in which the polarization is not noise-like but where its behaviour 
is chiefiy governed by a mechanism other than Faraday rotation (^3 discusses specific examples of 
this). 



3. RM synthesis of Centaurus A data 

In this section we compare the performance of RM synthesis against other techniques of deter- 
mining the polarization and rotation measures of radio sources. As an initial step, we investigate 
the accuracy of the rotation measures reported by RM synthesis as a function of signal-to-noise 



ratio. We then compare the rotation measures derived by RM synthesis against a technique that 
mimics the technique used to derive the rotation measures extracted from the NRAO VLA Sky 
Survey (Condon et al. 1998; Taylor, Stil &: Sunstrum 2009) based on measurements of the polariza- 
tion in two closely separated bands at 1.4 GHz. We then compare the performance of RM synthesis 
against a more sophisticated technique that performs a least-squares fit to the polarization data. 

This analysis is based on the ATCA observations of 1005 background sources contained in 
a ~ 34deg^ field centred on the radio galaxy Centaurus A (Feain et al. 2009). The data have a 
bandwidth of 192 MHz divided into 24 x 8 MHz channels between 1296 and 1480 MHz. Because 
of the effect of the reweight option in Miriad a 5% correlation was introduced between channels 
resulting in 22.75 independent channels. The total bandwidth and spectral resolution of these data 
mean that the FWHM in Faraday rotation for a single polarization component is 280 rad m~^, 
and that it is sensitive to RMs with magnitudes less than ~ 3500 rad m~^. The relatively low RM 
resolution in these data avoids the potential complexity in the interpretation of any sources with 
closely-spaced multiple RMs, as discussed by Farnsworth, Rudnick & Brown (2011). We allowed 
our RM synthesis code to search to RMs of ±4000 rad m~^. The average root-mean-square error 
in U and Q is 0.089 mJy and the corresponding average sigma in P (which is the measure of noise 
used throughout this paper) is 0.12 mJy. 



3.1. Confidence versus S/N based on RM synthesis-derived parameters 

We performed RM synthesis on the 1005 sources in the Cen A field to investigate the reliability 
of the derived Faraday depths as a function of S/N ratio. Several recent papers adopt a S/N of 7 
as the threshold below which the results of RM synthesis are regarded as unreliable and are not 
reported (Feain et al. 2009; Law et al. 2011). It is therefore propitious to investigate whether this 
threshold represents a sharp barrier to the believability of RM synthesis results, or whether results 
below this level only decline gradually in accuracy. The large size of our sample enables us to 
empirically derive the level of confidence associated with RM synthesis solutions over a large range 
of S/N ratios. 

Figures [T] & [2] show the distribution of derived RMs for varying S/N thresholds. For the 
purposes of this analysis we have used only the single most significant Faraday depth derived by 
RM synthesis; the small number of sources in which there are multiple significant RMs has a 
negligible influence on the statistics (see ^. The signal was derived from the value of |-F| at the 
most significant Faraday depth. Some RFI had to be flagged in the time based data resulting a 
small variations in S/N between channels. This also varied from source to source. The noise, a, is 
the taken to be the average rms noise per channel for each source divided by the square-root of the 
weighted number of channels used. To simplify the theoretical analysis in the following sections 
we have assumed equal noise per channel, but it was then essential to correct for the effect of 
this non-uniform noise in the data by including the non-uniform weighting in the S/N estimates 
and in the estimates of the number of independent channels. We estimate this average rms noise 



per channel from the observations using the fact that the rms errors add in quadrature. Feain et 
al. (2009) provides specifics on the estimation of errors for the 8MHz Q and U channels which were 
interpolated from the correlator output. The effect of RFI flagging which resulted in non- uniform 
noise varying from source to source had to be included in the analysis. Even though these effects 
are small and subtle it is essential to account for them when analysing the polarization statistics. 

It is evident that all the observed RMs for high S/N detections are in the range —150 to 
+50radm~^, and we assume that the lower S/N detections are drawn from the same distribution. 
Polarization detections with RMs outside this range are most likely to be due to noise. At the 
lower S/N regime of interest here, the expected distribution is broadened by noise so we increase 
the false RM range by 2a and use the range —210 to +110radm~^ which is appropriate down to 
signal strengths of 5cr. If Faraday depths associated with false detections are evenly distributed 
over the depth search range - as appears to be the case based on Figures [T]& [2]- there is only a 4% 
chance that any given Faraday depth found by RM synthesis will fall in the range of expected real 
detections by accident. Table [1] gives the distribution of "true" and "false" polarization detections 
as a function of S/N. 

Some RMs outside this range could be real (see ^4.5p and while this does not significantly 
affect the statistical analysis, individual sources with high intrinsic RM are of interest (see ^. 
This population is rare in the flux density range of our sample since only one of the 309 sources 
detected above 60" possesses an intrinsic RM magnitude larger than 200radm~^. 

S/N > 7cr >6ct > So- >4cr > 3o- > 2a ^H 

Approx P(mJy) > 0.63 > 0.54 > 0.45 > 0.36 > 0.27 > 0.18 

Total number 

Number with good RM 

Number with false RM 

Differential number 

Differential number with 

good RM 

Differential number with 1 5 59 211 200 

false RM 

Table 1: The distribution of polarization detections. 

The one high RM source in the 6<S/N< 7 bin, 132446—460218, merits comment. This source 
has a clear double peak in the RM synthesis spectrum. The two peaks have nearly equal polar- 
ization: one is at 297radm^^ but the second at — 59radm~^ does fall within the expected RM 
range. The S/N difference in amplitude is < 10%. As discussed at the end of §5, the statistical 
treatment of a second RM component in the presence of a significant polarized signal is well beyond 
the analysis in this paper and this may be a correct but rare detection. 
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3.2. Comparison with traditional RM methods 

The traditional method to determine the RM has been to determine the position angle of 
polarization in each frequency channel and then to make a scalar least squares fit to the position 
angles. Figure [3] (top) shows the result of this analysis when applied to sources in the Cen A 
polarization catalogue. When compared to the RM synthesis solution for the same sources (bottom 
panel in Figure [3]) it can be seen that a small number of spurious high RM values are obtained. 
Only those sources with integrated polarization signals S/N>7, as identified by RM synthesis, were 
used in the analysis. It is evident that there is a tail in the distribution of the Faraday depths 
derived by fitting polarization position angles. Of the 251 sources with S/N>7 used in the analysis, 
16 of these fall outside the range [—210, 110] radm^^. This can be explained by noting that with 
the lower S/N in each channel the least squares fit for weak sources can find solutions which are 
randomly distributed over the RM space included in the least square solution. This high RM 
tail is reminiscent of the tail of high RMs sometimes seen in existing RM catalogues e.g. see the 
comparison of the NVSS catalogs and the Kronberg &: Newton-McGee compilation in Pshirkov et 
al. (2011). The small number of genuine high RMs seen in our dataset (< 1%) indicates that this 
class of radio source is rare in surveys at 1.4 GHz, presumably because sources with high intrinsic 
RM are more likely to be strongly depolarized. 

Taylor et al. (2009) report the rotation measures of 37,543 sources from the NVSS based 
on VLA snapshot images in two 42 MHz wide bands centred on 1364.9 MHz and 1435.1MHz. It 
is opportune to investigate the accuracy of this catalog given that it currently represents one of 
the largest resources of rotation measures, and is rapidly being adopted as a de facto standard in 
analyses of the Galactic and extragalactic magnetic fields (Harvey-Smith, Madsen & Gaensler 2011; 
Law et al. 2011; Opperman et al. 2011; Govoni et al. 2010; McClure-Griffiths et al. 2010; Schnitzeler 
2010; Stasyszyn et al.2010). 

We used the Cen A polarization catalog to investigate the accuracy of this technique used to 
derive NVSS rotation measures. The data were obtained at a comparable frequency, and we have 
collapsed them into two spectral channels with similar centre frequencies of 1364 and 1436 MHz. 
As for the NVSS sample, the range of Faraday depths in our sample is sufficiently small that 
phase wrap ambiguities will not occucl- The RM results for the two-band measurements are in full 
agreement with the RM synthesis results. The agreement with the two-band RM synthesis results 
is unsurprising given that the bands are contiguous. 



^In the NVSS, the actual rotation measure of a source is related to the deduced value assuming no phase wraps, 
RMo, by the relation, RM = RMo + (681radm~^) m, where m represents the number of integral phase wraps that 
occurred between the two observing frequencies. Taylor et al. (2009) assume \m\ < 1, given that RMs of the magnitude 
required to effect a phase wrap are rare when the source is located off the Galactic plane and away from the inner 
Galaxy. 



3.3. Comparison of RM synthesis and least squares fits 

Although RM synthesis outperforms traditional frequency-channel-based position-angle fitting 
algorithms such as the ones discussed above, it remains to be seen whether it makes the best use of 
the polarization information available. We address this by considering the output of a brute force 
minimum least squares search for detected polarization at any RM on all the sources, subject to 
the assumption that the source contains only a single polarized component that is assumed to have 
a flat spectrum. To be specific, we implemented a code that seeks the parameters P, xo and (j) 
(polarization amplitude, position angle and Faraday depth) that minimize the penalty function, 



X 



2_ 1 ^[Qj-Re[pj{P,xoA)? , [Uj-Im[pj{P,xoA)? 



^2 + T2^ ' (9) 



2N-3^ a^Q at; 



where the model for the polarization of the jth channel is, 

Pj{P,Xo,(t)) = PeMHxo + X]<P)]- (10) 

Figure H] compares the solutions from the RM synthesis with the brute force least squares fit 
approach. The two methods agree very well in both RM and polarization amplitude. At high S/N 
the solutions are identical. The increased scatter at lower S/N is not surprising as the two methods 
find different solutions, but the gap (along the Faraday depth axis) surrounding the identical 
solution indicates that both methods converge to the identical solution if the solutions are close. 
There is no systematic change in the ratio of signal amplitude between the two methods with S/N, 
and the average ratio for the two methods is 1.0. 

The behaviour of the solutions for the sources with a "false" detection based on the RM has 
a number of undesirable properties. The brute force solution has a much narrower distribution of 
RMs at low S/N (it 500 rad m~^). The amplitudes of the bad solutions are lower and the reduced 
X^ worse indicating that it does not find the global minimum for noise-dominated signals. 

In i j4.2l it is demonstrated that the noise does fall as A^~^' ^ for RM synthesis. The naive 
expectation would be that it falls similarly for traditional methods, since the estimated standard 
deviation in the slope of a regression line scales as N~^''^ . However, in practice this dependence is 
more complicated for traditional methods in a way that is hard to analyse; nvr phase ambiguities 
in the position angle introduce additional degrees of freedom into the fitting problem, and the fit 
error depends in detail on how the channels are spaced. 



4. The statistics of polarized sources 

Determination of the distribution of polarization amplitudes intrinsic to a set of sources involves 
disentangling the contribution of noise from the polarization amplitudes deduced by RM synthesis. 
This is most important for sources whose intrinsic polarization is near the noise level. 
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We are motivated to examine the noise statistics in detail with a view to accounting for its 
effects in the distribution of polarization amplitudes measured in a set of sources. Specifically, 
the objective is to evaluate the quantity p{P\Pq), namely the probability of obtaining an observed 
polarization amplitude P conditioned on the amplitude of the true intrinsic polarization, Pq. This 
distribution function is the basis for deriving the probability distribution of intrinsic (i.e. "noise- 
corrected") polarization amplitudes based on a set of measured polarization amplitudes determined 
by RM synthesis. We conclude this section by applying the formalism to investigate the distribution 
of polarization amplitudes in the Cen A dataset at low polarization amplitude levels. 



4.1. The polarization distribution of noise 

Here we derive the distribution of polarization amplitudes recovered by RM synthesis from 
pure noise. The input measurements Qj and Uj are therefore assumed to possess the properties of 
thermal noise, and are independent and normally distributed with zero mean and noise per spectral 
channel a. 

Recall that RM synthesis derotates the channelized measurements of Q and U for a range of 
different trial rotation measures, <J3. The resultant summed (complex) number is 

r = ±Y,Pje'^'e^'^''^. (11) 

j 

The polarization vector derived by RM synthesis is the value of V for which the choice of <j) gives a 
maximum in the value of P. The task here is to determine the distribution of the amplitude of V. 
If (p is given it is straightforward to recover the polarization distribution. However, an additional 
complication arises because the RM synthesis algorithm is allowed the extra degree of freedom to 
chose any value of (p which maximises P. The freedom to rotate the measured polarization vectors 
over a range of trial values of 4> ensures that the polarization vector amplitude returned by RM 
synthesis is at least as large as that found in the (j) = case. This is because the algorithm may 
derotate the measured polarization vectors with any value of (p which improves the coherence of 
the sum in eq. (jlip . 



There is a direct analogy between the noise characteristics of RM synthesis and those encoun- 
tered in interferometric radio imaging, which involves deriving the distribution of noise amplitudes 
from (complex) visibilities (see Section 9.3 of Thomson, Moran & Svenson 2001). In both cases one 
is attempting to "wind-up" complex valued quantities whose real and imaginary components are 
both normally distributed. In the case of RM synthesis, they are wound up for various values of 
Faraday depth. In the interferometric imaging case the complex-valued interferometric visibilites 
are searched across a range of fringe rotation values, corresponding to the possible position of the 
source. Thus the two treatments are mathematically identical. We elaborate on this connection in 
the analysis below. 
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4.1.1. Constant (j) 

First consider the simple, well-known case in which we exphcitly hold the rotation measure 
at a fixed value. Since the input data is purely noise-like the actual value of (j) is unimportant: 
any noise-like polarization vector which is transformed by derotation to any nonzero value of (j) 
still possesses the same statistical noise-like properties. Here, for the purpose of simplicity, we set 
(^ = 0. The polarization found by RM synthesis is, 



^0=0 = ^ 



N N 



(12) 



The statistics of the sum of N independent, normally distributed random variates is also normally 
distributed, with standard deviation aQ^jjy^ ■ Thus Sq = N~^ ^ • Qj also follows a normal 



distribution with zero mean and standard deviation aq^u/vN. An identical relation holds for 
Su = N~'^ ^ • Uj. It is then straightforward to show that the probability distribution for the 
polarization amplitude when there is no signal present and (/> = is just a Rayleigh distribution, 

pp{P)dP = ^ exp (-^) dP, (13) 

where we define S^ = a"^ /N . This result is well known in the analogous interferometric imaging 
case pertaining to noise in VLBl observations, where it corresponds to the statistics of the fringe 
amplitude for a specific fringe rotation when no signal is present (Thomson, Moran & Swenson 
2001, p318). 



4.1.2. Unconstrained <f> 

Now consider the general case in which RM synthesis performs a fit to the polarization data 
that is unconstrained in Faraday depth, (p. This consists of two separate steps. The first is to find 
the specific value of (p which maximizes the value of P for a given dataset. The second step is to 
find the distribution of P corresponding to these maxima. 

The square of the polarization amplitude is 

N 

k,m 

N 

= J^PfcP„^cos[V'fc-^™ + 20(Al-A^)]. (14) 

k,m 

The second line follows because the sine function has odd symmetry, so the imaginary part of every 
pair k, m in the sum is cancelled by every pair m, k. 



12 



The rotation measure (j) should obey the followmg condition at the maximal value of P, 

N 



d{VV*) 



Y, 2^(Al - \l,)PkPme'^^^-^'-h^''^^^l-^^\ (15) 

k,m 



Since the imaginary terms are antisymmetric with respect to exchange of the indices k and m, we 
see that imaginary part of this sum is zero for any value of (j). So only the real part of the sum 
yields a nontrivial constraint on cj): 



N 



= E(^' - A^)PfcP™sin[Vfc -^m + 2^{Xl - Xl)]. (16) 

k,m 

The foregoing relations show that the distribution of V exhibits a detailed dependence on the 
spectral coverage (i.e. the values of Xf.). This indicates that the exact location and amplitude of 
the noise peak depends on the A|, and that the noise distribution for any given spectral setup is 
best found empirically. 

It is possible to construct a simple approximate analytic formulation independent of the de- 
tailed spectral coverage. This result has already been derived in the analogous case of the noise 
properties in VLBI images, where one is concerned with the statistics of the fringe amplitude across 
(Thomson, Moran & Swenson 2001, pp319-211). We briefly summarize the argument here and place 
it in the context of RM synthesis. 

The statistics of the polarization amplitude at any one value of (p obey a Rayleigh distribution. 
At any given value of (p the probability that we detect a polarization amplitude lower than some 
prescribed threshold amplitude, say Z, is the cumulative probability distribution corresponding to 
eq. (113p . namely, p{Z < P) = 1 — exp (— Z^/2S^). If the RM synthesis search is conducted over 
a large range of possible Faraday depths, —cpoKcpKcpo, and the width of the amplitude of the 
RMTF is approximately A^, there are A'" ~ Icpo/Acp independent Faraday depth "pixels", or trials, 
over which the RM search is being performed. The probability that we do not detect a polarization 
amplitude above some amplitude Z after these N trials is p^ . The probability of finding Z < \V\ 
is just the culmulative probability distribution function of the amplitude of the polarization vector. 



CDF„(Z) 



1 — exp 



Z2..N 



2S2 



(17) 



where the subscript n emphasises that the distribution is associated with a purely noise-like signal. 
The corresponding probability density function is. 






. Z 

1 — exp 



2S2 



N-l ( Z'^\ 

exp(-^j. (18) 



This expression describes the expected distribution of the polarization amplitude when the input 
data are purely noise-like and in which the rotation measure is searched over approximately A^ 
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independent Faraday depth pixels. It is identical the characteristics of the noise in a search for 
fringes from A^ independent samples (Thomson, Moran & Swenson 2001, eq 9.57). The exact noise 
distribution pertaining to the Cen A dataset is discussed at the conclusion of this section. We pre- 
empt this by noting that we find empirically that the noise distribution for the Cen A observations 
(shown in Fig. [6]) differs slightly from the analytic formula derived here. 

The behaviour of the noise distribution has a number of important implications for the statistics 
of polarization amplitudes determined by RM synthesis. Equation (jlSp shows that the distribution 
of polarization amplitudes derived from noise narrows as A^ increases (for constant S), and the 
peak of the distribution shifts to increasing values of P/S. The location of the peak is determined 
by the solution to the transcendental equation, 

p2 [N - exp(PV2S^)] = S^ [1 - exp(PV2S^)] . (19) 

This has solutions P/S = 1.93, 2.24, 2.84, 3.07 and 3.54 respectively for iV = 5, 10, 50, 100 and 500. 

Thus the threshold signal-to- noise, P/S, for the believability of a polarization signal extracted 
using RM synthesis exhibits an important, albeit weak, dependence on the number of independent 
trials, A^. The distribution peaks sharply on the low P/S side of the distribution, implying that one 
expects very few polarization amplitudes with signal-to-noise values less than 2.0 once A'^ exceeds 
10. For the analysis in this paper A^ = 30. 



4.2. The polarization distribution of a nonzero polarization signal containing noise 

There is an important distinction between the polarization distribution obtained from a purely 
noise-like signal from RM synthesis and the polarization distribution derived when an intrinsic 
(nonzero) polarization signal is present. The distinction arises because, in the absence of an intrinsic 
polarization signal, the RM synthesis algorithm picks out the polarization vector associated with 
the Faraday depth that maximises the length of the "Faraday-derotated" noise vector. However, 
this freedom to optimise the length of the polarization noise vector is removed provided the intrinsic 
signal is sufficiently large that the RM synthesis code finds the highest amplitude at the correct 
Faraday depth of the intrinsic signal (i.e. there is no other Faraday depth at which a purely noise-like 
signal yields a higher polarization amplitude). In this instance the RM synthesis code is constrained 
to derotate the signal at the Faraday depth of the intrinsic polarization signal. 

We compute the probability distribution of the polarization signal subject to the assumption 
that the intrinsic polarization is nonzero and that the RM synthesis code detrotates the signal at 
the Faraday depth associated with the intrinsic signag Suppose we have a nonzero intrinsic signal 



^The contribution of noise, at the Faraday depth of the intrinsic signal may, in principle, slightly alter the depth 
at which the peak amplitude occurs, and hence the Faraday depth reported by the RM synthesis algorithm. This 
may, in turn, influence the statistics of P. However, this is a small effect in most practical situtations. If the Faraday 
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whose Faraday de-rotated channel measurements of the hnear polarization are Vo = {Qo,Uq). 
Associated with each spectral channel there is also a noise contribution. The noise follows a 
Gaussian distribution. It is also uncorrelated with the intrinsic signal and thus its statistical 
properties remain identical after de-rotation at the Faraday depth of the intrinsic signal. We write 
the derotated noise polarization contribution for the ith spectral channel as AQj and AUj. Then 
the polarization vector derived by RM synthesis is 

V = {Qo + Sq,Uo + Su). (20) 

where Sq = N^'^ ^^ AQj and Su = N''^ Yli ^Ui (as defined previously). The polarization vector, 
7^, therefore obeys the statistics of a random walk with a constant offset Vq = {Qo, Uq). Thus the 
polarization amplitude in the presence of an intrinsic signal follows the Rician distribution, 



p,{Z = P\Po) = ^e^p 



2 I d2 



Z' + P, 

2S2 



^o(^), (21) 



where Iq{x) is the modified Bessel function of the first kind of order zero and the subscript s 
denotes that the distribution is valid only when a nonzero polarization signal is present so that the 
signal is derotated at the Faraday depth corresponding to the intrinsic signal, Vq. The associated 
cumulative probability distribution function (i.e. the probability oi Z < P) is 

CDF,(Z|Po) = l-Qi(:|,|), (22) 

where QmicLjb) is Marcum's Q function. This result is analogous to the probability distribution 
of the interferometric fringe amplitude in which one measures a nonzero signal in the presence of 
noise (Thomson, Moran & Swenson 2001, eq. 9.37). 

Some remarks about the treatment of polarization noise bias are in order. The polarization 
amplitude distribution in eq. ([2T]) demonstrates that the measured amplitude is, on average, biased 
by noise. In particular one has, 

(p2^ =p2_^2S2. (23) 

Thus the appropriate noise bias to subtract from a single measurement of P^ derived on the basis 
of RM synthesis is twice the channel- integrated noise variance, S^. While correct on average, bias 
subtraction is clearly inexact for any given source; in as much as the noise can assume a range of 



depth derived by RM synthesis is sufficiently close to the correct depth it will not alter the statistics of P; this is the 
case provided that the S/N is sufficient that the error in Faraday depth is a small fraction of the width of the rotation 
measure transfer function, R{4>) (i.e. the S/N > 2). Winding up an intrinsic polarization signal Po at a depth that is 
wrong by an amount A^ yields an intrinsic signal of amplitude ~ Po cos AqAc/i, so the error in the amplitude of the 
intrinsic polarization vector, ~ Po(AoA<^)^/2, is quadratic in AoA</>. This is a small quantity since, for S/N > 2 the 
peak is localised to within the width of R{(f>), one has AqAi^ <C 1, so that p{Z\Po) ~ p{Z\Po cos Xo4>)- We see in the 
following subsection that the condition S/N > 2 is always satisfied. 
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values, so is there a range of values of intrinsic Pq that are consistent with a given measured P. 
Our analysis shows that it is entirely inappropriate to simply remove a noise bias from the data 
and treat the resulting polarization amplitudes as if they were distributed with a Gaussian error 
distribution. 



4.3. Likelihood of detection 

In the two forgegoing subsections we have derived the probability distributions of the po- 
larization amplitude both when a signal is absent, and when one is present and its amplitude is 
sufficiently large that it dominates the noise (i.e. provided Pq exceeds some threshold). 

Here we deduce the probability distribution of the polarization amplitude that is valid for all 
Pq . The key to this derivation is that there is a nonzero probability that the value of P drawn from 
the signal distribution ps is lower than the value that would be drawn from the noise distribution 
Pn- Under this circumstance, the value of P picked by the RM synthesis algorithm is the greater 
of the two values, namely that drawn from the noise distribution. 

The probability of obtaining a polarization amplitude of Z = P is the sum of (i) the probability 
of drawing Z from the noise distribution and requiring that its value exceeds the amplitude drawn 
from the signal distribution, and (ii) the probability that Z was instead drawn from the signal 
distribution and requiring its value exceeds the amplitude drawn from the noise distribution. Thus, 
one obtains the combined probability distribution, 

Pc{Z = P\Po)=pn{Z)CBFsiZ\Po)+psiZ\Po)CDFniZ). (24) 

This distribution is valid for any value of Pq. It obeys the expected property that at low Z and 
Pq the distribution is dominated by the noise distribution, pn{Z), because CDF„(Z) is almost zero 
in this region, while it is dominated by Ps{Z\Pq) at high Z because Pn{Z) is almost zero in this 
domain. 

By way of illustration. Fig. [5] compares the combined probability distribution for some specific 
choices of Pq against ps and p„ . 

One can extend the foregoing logic to determine the likelihood that a signal found by RM 
synthesis at a given amplitude, Z, is a genuine signal. The probability that the signal has been 
derotated to the correct value of (p is, 

Ps{Z\Po)CBF^{Z) 
P<t> correct (^ I -H)) - ^ (7\-p\ • ^^^> 

It is apparent that, even for small values of Z and Pq, there is a nonzero probability that the RM 
synthesis code deduces the correct Faraday depth of the signal. 

This is borne out by the Cen A data: close inspection of the lower panel in Fig.[T] indicates 
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that even at S/N ~ 3.5 the RM synthesis algorithm preferentially finds signals within the range of 
physically acceptable values of (/>. 



4.4. Deduction of unbiased polarization amplitude source counts 

The foregoing formalism provides an obvious prescription to determine the distribution of 
intrinsic polarization amplitudes, N{Pq) based on the measured distribution of polarization ampli- 
tudes as deduced by RM synthesis, namely N{P): 

N{P) = I dPo p{P\Po)N{Po). (26) 

The solution of eq. (|26p to find N{Pq) is formally an ill-posed problem, but is it of a form that is 
often encountered in numerical optimization theory. 

It is instructive to demonstrate how the solution of eq. (|26p is readily ammenable to numerical 
solution. The distribution of intrinsic polarization amplitudes, N{Pq) must be solved by sampling 
onto a grid with resolution, say, A|P|. We represent this distribution by the vector yj. The 
corresponding distribution of measured polarization amplitudes is discretized onto a grid in P with 
identical resolution, and this is represented by the vector Xj. If one similarly discretizes pdZ \ Pq) 
into a matrix whose i,jth. element corresponds to the parameters Zi and Pqj, with resolution AP 
in both dimensions, the integral in eq. (|26p can be cast in the forin. 



Xi = J2^ijyj^P- (27) 

j 

This class of problems is ammenable to solution by iterative deconvolution and expectation-maximization 
algorithms. 

A simpler but more robust approach is to forward- model the expected distribution A^(l'Pol) to 
determine whether it is consistent with the observed amplitude distribution. 



4.5. Application to Centaurus A data 

As an illustration of the foregoing formalism, we apply it to determine the distribution of 
intrinsic polarization amplitudes in the Cen A dataset. 

In order to model the effect of noise in this dataset precisely, we determined the noise dis- 
tribution pn empirically based on the specific spectral coverage of the ATCA observations. The 
same RM synthesis code used to process the Cen A data was employed. We synthesised 50, 000 
purely noise-like sources where, for each source, synthetic Q and U data for each spectral channel 
were drawn from a normal distribution of zero mean and unit standard deviation, and assigned to 
spectral channels with frequencies identical to those used in the Cen A observations. 
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For each source the "signal" was extracted at the value of (p at which the amplitude in the 
Faraday dispersion spectrum was a maximum. The statistics of the polarization amplitude at 
a fixed Faraday depth, (p = 0, were also recorded. The resulting probability distributions are 
shown in Fig.[6l There is excellent agreement between the distribution for the i;^ = case and the 
Rayleigh distribution discussed in ^4.1.11 The lower panel of Fig. [6] shows the amplitude distribution 
determined from the synthetic noise dataset when the RM synthesis algorithm is allowed to find 
a source anywhere over the Faraday depth range, [—4000, +4000] radm~^. One expects naively, 
on the basis of the width of the RMTF and the range of the search in Faraday depth, that there 
are N ^ 29 independent Faraday depth trials over the search space. There is good agreement in 
the shape of the theoretically- and empirically-derived distributions for this value of A^. However, 
it is evident that the theoretical distribution peaks at a lower S/N than the empirically-derived 
distribution. This cannot be accounted in terms of a change in N since, although this would shift 
the peak of the distribution to higher S/N, it would also narrow the distribution to a value lower 
than is observed. We note that, from a purely empirical viewpoint, the two distributions agree 
closely if the S/N axis is scaled such that the recovered S/N values from the simulated sources are 
systematically higher than those expected in the theoretical distribution by 8.5%. The distribution 
corresponding to this is also shown in Fig.[6l However, the reason for such a S/N discrepancy is 
not obvious; the lower right panel of Fig.H] shows that there is no systematic bias in the S/N of 
sources recovered by the RM synthesis technique and a brute-force fit to the polarization data. 

One can directly compare the polarization amplitude distribution for the sources in the Cen 
A field with "false" RMs (see §3.ip against the distribution, p„ as determined by the synthetic 
pure-noise sources. This comparison is shown in Fig. [71 In this plot we see excellent agreement 
between the polarization amplitude distribution of sources with false RMs and the expected noise 
distribution. However, there is an excess of sources in the region 4<S/N<5 which deserves further 
investigation. A likely explanation for this excess is that a number of these sources are mistakenly 
identified as false detections. It is possible that several contain real signal, but their S/N is suffi- 
ciently low that their Faraday depth is poorly localized by RM synthesis (i.e. they are only localized 
to about the ~ 280radm~^ width of the RM transfer function), thus placing the detection RM 
outside the nominal range [— 210, 110] radm~^ considered to constitute a "good" detection. The 
excess of sources can be seen in Fig. [2] as the concentration of sources in the 4 < S/N < 5 bin with 
RMs in the range [0, 500] radm^^. The results of §4.21 (and Fig. [5]) indicate that when some small 
signal is present the polarization amplitude distribution shifts to higher S/N. This suggests that 
some sources with supposedly "false" RMs do indeed contain intrinsic signal. 

A related statistic is p . correct ' ^^^ expected fraction of sources whose Faraday depths are iden- 
tified correctly as a function of P for a given Pq. We compute this quantity using the numerically- 
derived distribution, pc- Figure [8] displays this likelihood. This gives a direct estimate that the 
probability is correct at any given RM. At S/N w 3 the probability of retrieving the correct Faraday 
depth is Ri 50%, and it approaches 100% for S/N values between 5 and 6, depending on the value 
of the intrinsic polarization value, Pq, 
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It is interesting to compare the differential source counts from the Cen A data in Tabled] 
against this curve, despite uncertainty in the intrinsic polarization amplitudes that contribute to 
the Cen A data. At S/N values of 3, 4 and 5, the measured fractions of good RMs are 26%, 57% 
and 92% respectively, while the predicted fractions at these S/N values are in the ranges 25-57%, 
61-93% and 94%-100% respectively. In estimating the range of fractions, we have assumed that the 
S/N of the intrinsic polarization, Pq, is between 2.5 and 5. 



4-5.1. Forward Modelling of Polarization Source Counts 

It is beyond the scope of this paper to implement a complete iterative deconvolution scheme to 
recover the intrinsic polarization source counts, N{Pq), using eq. (|26p in conjunction with pdZ \ Pq). 

We have instead developed a simple iterative forward-modelling scheme in which the intrinsic 
polarized source count distribution is taken to be of the form, 

' "' \ (Po/Ef, Po/J:<C, ^ ' 

which is a broken power law distribution with index ai < that turns over at a S/N of C and 
scales with an index —4 < a2 < 4 below this S/N break, and where A is an overall normalizatioru. 
This distribution was gridded onto the vector y with a resolution in AP/T, = 0.1 and substituted 
into eq. (I27p to find the predicted distribution of measured P values, which we label Xpred- This is 
compared against the actual distribution vector Xmeas by computing the reduced-x^ of the fit, 

1 ^bins |„ _^ _„ 12 

2 _ -*- V^ l-x-pred,; -X-measJ .„„^ 

^ - iVbins - 4 ^ af ■ ^^^^ 

where we take the errors to be af = Xmeas, (and set ctj = 1 when Xmeas, = 0). The quantity x^ 
was searched for a minimum in terms of the parameters A, C, ai and 02- The minimum occurs 
at x^ = 0.85 for {C,ai,a2,A) = (3.72,-2.17,-0.30,2.60 x 10"^), and the best-fitting solution is 
plotted over the measured polarization distribution in Figure [9l However, the minimum formally 
supports a range of indices in the range —2.9 < ai < —1.0, and we note that the values of ai and 
02 and C are all highly correlated. To illustrate this. Table [2] below lists the derived fit parameters 
for a range in the break S/N, C. 

The best-fitting index of the slope of the differential counts in P at high S/N, ai = —2.17, 
is very close to the value of —2.2 obtained by fitting just the stronger well-detected sources in the 
sample with P in the range 0.6 to 6mJy. This indicates that the source counts in P continue with 



^Since the distribution p{P \ Pq) is zero for Po/S < 2, the fact that N{Po) diverges for small Pq when 0:2 < does 
not pose a problem for our modelling. We are effectively insensitive to the behaviour of the distribution at Po/S < 2. 
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c 


x' 


ai 


02 




4 


0.0 


1.12 


-0.96 


- 


119 


1.0 


0.93 


-1.53 


4 


1.56 


xl03 


2.0 


0.88 


-1.90 


1.01 


8.09 


xlO^ 


3.0 


0.86 


-2.04 


-0.05 


1.48 


xlO^ 


3.72 


0.85 


-2.17 


-0.30 


2.60 


xlO^ 


4.0 


0.85 


-2.22 


-0.37 


3.27 


xlO^ 


5.0 


0.87 


-2.40 


-0.54 


7.39 


xlO^ 


6.0 


0.91 


-2.54 


-0.66 


1.45 


xlO^ 


7.0 


0.95 


-2.64 


-0.74 


2.50 


xlO^ 



Table 2: The fit quality and associated fit parameters as a function of location in the spectral 
break, C. This table illustrates the covariance between the break S/N and the spectral indices of 
the polarization amplitude count distribution. 

the same slope down to at 0.35 mJy, and possibly lower. Note that this forward modelling method 
does not require any assumptions about polarization amplitude bias at low S/N. 

Following the technique used by Tucci et al. (2004) we can now estimate the change in fractional 
polarization as a function of flux density by comparing the source counts in I and P at the same 
source density. For both FIRST (White et al. 1997) and these Centaurus A field sources the 
differential counts in / have a slope of —1.8 in this flux range. The fractional polarization is 
3.0% for the strongest sources (> 200 mJy) and this increases to 3.5% for sources below 20 mJy. 
This can be compared with the mean fractional polarization of ATLSB sources {Sy > 0.4 mJy) of 
4.3%. Taylor et al. (2007) make a detailed analysis of polarization in the Elais Nl field. They have 
786 sources with 83 polarization detections. We have 1005 sources and at least 346 polarization 
detections (SS) in the same flux density range. We agree well in the source density and polarized 
fraction at the higher flux density but our fractional polarization for sources below 20mJy (3.7%) 
is less than the increase to 4.8% seen by Taylor et al. (2007). 



5. Sources with complex polarization 

To evaluate the reality of sources which have multiple RMs we evaluated the reduced x^ 
after KM synthesis identified each new CLEAN componenl|f|, and used this to help determine the 
significance of each new component. To evaluate the success of this process and to see what type 
of source has high reduced x^ "^6 looked at the 21 sources with reduced x^ > 2.4 and S/N > 5. We 



^Recall that CLEAN recognised all components that were within 5 rad m ^ of each other as being part of the same 
component. 
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can divide these sources into four categories: 

1. Some residual bad data points were readily identified in this process and these (6) sources do 
not appear in the list when the bad data is flagged out. 

2. High S/N (3 sources). The high ^ associated with some high S/N sources could be due to 
real but weak components with different RM which are only visible at high S/N. It may also 
result from instrumental dynamic range errors. 

3. Multiple RM (8 sources). These are well fit by multiple components. Note that for sources 
with multiple RMs we are unable to uniquely separate components within our RM resolution 
of ~280radm-2. 

4. Significant spectral change in P across the band (4 sources). This is likely to be caused 
by regions with closely spaced RM and polarization vectors as discussed by Farnsworth et 
al. (2011), although it may in principle be caused by depolarization effects. 



Table 3: The fit parameters for the 


source 131943-445904, as shown in Fig.IH 


Component no. RM (radm~^) 


reduced x^ after inclusion Pq (mJy) S/N 


1 -120.4 

2 171.2 

3 -323.2 


4.5 1.41 15.0 

2.6 0.94 10.0 
1.6 0.54 5.8 


Table 4: The fit parameters for the 


source 131713-410934, as shown in Fig.HH 


Component no. RM (radm^^) 


reduced x^ after inclusion Pq (mJy) S/N 


1 -74.8 

2 118.4 

3 -686.8 


2.7 13.1 66.8 
1.7 0.56 2.9 
1.1 0.45 2.3 



We illustrate these effects with two sources from this list: 

131943-445904 has three significant and comparable amplitude RM components which are just 
separated at our RM resolution. The three components are a good fit and reduce y^ from 4.5 to 
1.6. See FigurefTOl and Table [3l 

131713-410934 is a high S/N case with a dominant RM components at — 75radm^^. It also 
has a barely resolved component at 118radm~^ which causes the slope in P and a much weaker 
(3%) well resolved component at RM= — 687radm^^. See FigurefTTJ and Table HI 

It is beyond the scope of this paper to present a detailed discussion of the sources with complex 
RM structure but we can make a few useful observations. Sources with RM structure less than 
our resolution (280 radm^^) can still cause strong variations in polarization amplitude across the 
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band and the RM clean process can fit this change in amplitude with components separated by 
approximately the HPBW. While this is still a valid indication of significant RM structure, the 
"super resolution" of our CLEAN process (i.e. the ability to resolve RM components within the 
width of the RMTF) is unlikely to find a unique solution for the actual RM components. 

Although we do see clear evidence of multiple RMs in 12 of the 359 sources with S/N in 
polarization amplitude greater than 5, this is only 3% of our sample and the majority of sources 
are well fit by a single RM component. 

To better quantify this fraction we can ask down to what S/N one can we detect a second RM 
component by examining to what degree the addition of the second component improves the fit to 
the spectropolarimetric data. We start with a model consisting of one polarized component Mi 
and compare it against the measurement vector m. Let us assume that the measurements in each 
spectral channel have the same error, a. Then, if the vector m contains A'^chan components, one has 

X? = 7^Tr^— Jv;2(Mi-m)2. (30) 

(2iVchan - o)(T 

Now let us introduce a second polarized component, M2 that improves the fit and produces a new 
reduced x^ of 

^2 = (n.r ^ a^ 2 (^1 + M2 - m)2. (31) 

(2A'chan - 6)0-^ 

Subtracting the two foregoing expressions and, if the data are well explained by two polarized 
components, one has Mi — m ~ M2 we find, 

S2[x^(2Ar,han - 6) - X?(2Arehan " 3)] « 3 1 M2 | VA^chan , (32) 



where we use the fact that S = cr/^/Nchan- Recognising that IM2I /A'chan is the average squared 
amplitude of the second polarized component, say P2, we find, 

^ « ^[xi(2Archan - 6) - X?(2Arehan " 3)] (33) 

For our measurements, one has A^chan = 22 and we can see that, to improve the fit from Xi = 2.4 
to 1.3, one reqires a component with a S/N of P2/S ~ 3.6. 

We note that the statistical treatment for the detection of a second RM component in the 
presence of another strongly polarised signal is very different from the analysis of the detection of 
a polarised signal in the presence of noise. The perturbation that an additional polarised signal 
generates will be closer to the normal detection statistics so the weak multiple components we find 
at the S/N > 3 level are already significant. 



6. Conclusions 

Our major conclusions are as follows: 



22 



We have assessed the distribution of false and correct RM detections in the 1005-source sample 
of Feain et al. (2009). This yields a quantitative estimate of the likelihood that a given RM, as 
found by RM synthesis, is correct, as a function of S/N. If 4 <S/N< 5, we find the likelihood 
of finding a correct RM is about 52%. Restricting the range of possible RM solutions to the 
range -210 to llOradm"^ (the range found for the high S/N sources) further decreases the 
likelihood of a false RM detection; there is only a 4% chance that any given Faraday depth 
found by RM synthesis in a search over the range [—4000, 4000] radm~^ will fall in the range 
of expected real detections by accident. 

There is no systematic difference in the polarization amplitude recovered by RM synthesis 
compared to a least-squares fitting approach as a function of S/N, and the average ratio of 
the amplitudes found by the two methods is 1.0. 

We have investigated the effect of noise in RM synthesis and examined the probability dis- 
tribution of the polarization amplitude as determined by this algorithm. From this, we have 
developed a formalism to recover the distribution of intrinsic polarization amplitudes from a 
measured distribution which is affected by noise. 

This enables us to recover the polarization distribution at S/N levels well below the confidence 
level for any single detection. We have applied this to examine the polarization counts of the 
sources in the Feain et al. (2009) Cen A catalog; the best fitting polarization source count 
distribution follows N oc P^2.35^ which is consistent with N oc p^^.ie found by fitting only 
to S/N> 7 detections, and with no turnover at an amplitude above 0.2 mJy. 

A related point is that polarization "bias" should not be subtracted from the derived polar- 
ization amplitudes when analysing the distribution of P. Subtraction of this bias is often at 
best misleading, since the distribution of P is highly non-Gaussian. 

Pre-existing analyses have used a S/N cutoff of ~ 7 as a limit to the believability of results 
from RM synthesis. We derive here the likelihood that a given RM detection is correct as a 
function of S/N and the intrinsic polarization amplitude (Pq) of the polarization signal. We 
also derive the criteria for the likelihood of the detection of a second RM component. For 
instance, a detection at a S/N ~ 3 is sufficient to believe the existence of a second component 
if the addition of the second component reduces the x^ value of the fit to the polarization 
data from ~ 2.4 to ~ 1.3. 



The Australia Telescope Compact Array is part of the Australia Telescope which is funded by 
the Commonwealth of Australia for operation as a National Facility managed by CSIRO. 

Facilities: ATCA 
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Fig. 1. — Scatter plots showing the distribution of Faraday depths as a function of the signal-to- 
noise ratio of the polarization detection. The top panel shows the entire dataset while the lower 
panel provides a more detailed plot for S/N< 8. 
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Fig. 2. — Histograms showing the distribution of Faraday depths as a function of the signal-to- 
noise ratio of the polarization detection. The logarithmic scaling on the y-axis accentuates bins 
populated by only a single source. 
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Fig. 3. — Top panel: The distribution of rotation measures derived on the basis of least-squares fits 
to the polarization position angle measured in each channel. Bottom panel: the RM distribution 
of the same sources as determined by RM synthesis using the channelized data. 
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Fig. 4. — Comparison of the RM and polarization amplitude derived by RM synthesis compared 
to a brute force least squares fit to the polarization data, as described in §3.31 
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Fig. 5. — An illustration of the various distributions ps, Pn and pc as a function of P for A^ = 20 
and (top) \Vo\ = 1.5/S and (bottom) |Po|/S = 3.5. For values of \Vo\ < 1 the distribution closely 
resembles the noise distribution, pn, while for iT^ol/S > 1 it closely resembles the signal probability 
distribution, pg. 
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Fig. 6. — The amplitude of the polarization vector recovered by RM synthesis from a purely 
noise-like signal for the spectral channel setup used in the Cen A observations. The upper panel 
depicts the distribution when the rotation measure is constrained to a fixed value, here = 0. 
The overplotted curve (red) shows the predicted distribution (see ^4.1.ip . The lower panel de- 
picts the distribution of recovered polarization amplitudes when the rotation measure is allowed 
to fall anywhere in the range allowed for the polarization extracted from the real data, namely 
— 4000radm~ < (p < 4000radm~^. The overplotted curves show the theoretically expected distri- 
bution for the expected number of degrees of freedom, N = 29, using eq.[lHl The red curve shows 
the expected distribution, while the green curve shows the distribution that would be derived if the 
S/N values derived by RM synthesis were 8.5% higher than expected (see text for discussion). 
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Fig. 7. — A comparison of the probability distribution of polarization amplitudes found in the Cen 
A data which are regarded as having "false" RM identifications with the distribution expected of 
a purely noise-like signal based on a simulation of 5 x 10^ synthetic sources. The error bars on the 
Cen A source counts reflect the fact that the distribution is estimated from a finite (and small) 
number of sources, and scale as the square-root of the number of sources per bin. 
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Fig. 8. — The probability that RM synthesis correctly identifies the Faraday depth of the polarized 
signal as a function of the measured polarization amplitude, P. The curve is calculated for intrinsic 
signals of IPqI/^ = 3,4 and 5. The small oscillations in the curves are due to the fact that the 
noise distribution, pc, is not computed from an analytic expression, but from the histogram derived 
from the simulation of 5 x 10^ purely noise-like sources. 
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Fig. 9. — The observed source counts in P (cyan) and the best fitting model of it (black). 
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Fig. 10. — The 3-component polarization solution for the source identified in Feain et al. (2009) 
as 131943—445904. The fit parameters for each component are shown in Table [3l including the 
reduced x^ value after each successive component is added to the model. 
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Fig. 11. — The 3-component polarization solution for the source identified in Feain et al. (2009) 
as 131713—410934. The fit parameters for each component are shown in Table HI including the 
reduced x^ value after each successive component is added to the model. 



